BitPath – Label Order Constrained Reachability in Large Graphs
نویسندگان
چکیده
With ever expanding size of the web, sizes of graphs with semantic relationships are growing very rapidly. The semantic relationships are typically presented as edge-labeled graphs where nodes are the entities and the edge label signifies the relationships between the two entities. RDF graphs are an example of such graphs. Pure reachability queries on these graphs do not give any information about the semantic association between two remote entities. On the other hand, the typical size of semantic graphs present on the web has made general regular path query processing infeasible. In this paper we focus on the following graph path query problem – given a source node s, a destination node d and a sequence of ordered edge labels “label seq”, does there exist any path between the two nodes which has the edges labels in “label seq” (in the given order) on the path? We solve this problem by a combination of graph indexing, and a query processing algorithm based on a divide-and-conquer procedure and greedy pruning of the search space. We have evaluated our technique on graphs with more than 22 million edges and 6 million nodes – much larger compared to the datasets used in published work on path queries. We compare our approach with optimized-DFS, optimized-focusedDFS, and bidirectional-BFS methods.
منابع مشابه
BitPath – Label Order Constrained Reachability Queries for Large RDF Graphs
In this paper we focus on the following constrained reachability problem over edge-labeled graphs like RDF – given source node x, destination node y, and a sequence of edge labels (a, b, c, d), is there a path between the two nodes such that the edge labels on the path satisfy a regular expression “*a.*b.*c.*d.*”. A “*” before “a” allows any other edge label to appear on the path before edge “a...
متن کاملBitPath -- Label Order Constrained Reachability Queries over Large Graphs
In this paper we focus on the following constrained reachability problem over edge-labeled graphs like RDF – given source node x, destination node y, and a sequence of edge labels (a, b, c, d), is there a path between the two nodes such that the edge labels on the path satisfy a regular expression “*a.*b.*c.*d.*”. A “*” before “a” allows any other edge label to appear on the path before edge “a...
متن کاملEfficient processing of label-constraint reachability queries in large graphs
In this paper, we study a variant of reachability queries, called label-constraint reachability (LCR) queries. Specifically, given a label set S and two vertices u1 and u2 in a large directed graph G, we check the existence of a directed path from u1 to u2, where edge labels along the path are a subset of S. We propose the path-label transitive closure method to answer ted DAG by replacing the ...
متن کاملLearning in Probabilistic Graphs Exploiting Language-Constrained Patterns
The probabilistic graphs framework models the uncertainty inherent in real-world domains by means of probabilistic edges whose value quantifies the likelihood of the edge existence or the strength of the link it represents. The goal of this paper is to provide a learning method to compute the most likely relationship between two nodes in a framework based on probabilistic graphs. In particular,...
متن کاملConstrained Route Distribution for Border Gateway Protocol/multiprotocol Label Switching (bgp/mpls) Internet Protcol (ip) Virtual Private Networks (vpns)
This document defines Multi-Protocol BGP (MP-BGP) procedures that allow BGP speakers to exchange Route Target reachability information. This information can be used to build a route distribution graph in order to limit the propagation of Virtual Private Network (VPN) Network Layer Reachability Information (NLRI) between different autonomous systems or distinct clusters of the same autonomous sy...
متن کامل